Information aggregation and synthesization system

ABSTRACT

An information aggregation and synthesization system and process. The present invention provides aggregation and packaging of structured or unstructured information from disparate sources such as those available on a network such as the Internet. A network compatible/addressable interface device is operated by a user. The network interface device communicates with local datastores or network accessible datastores via an addressing scheme such as Uniform Resource Locator addresses (URLs) utilized by the Internet. Data passing between the network interface device and the datastores is accessed, polled, and retrieved through an intermediary gateway system. Such aggregated information is then synthesized, customized, personalized and localized to meet the information resource requests specified by the user via the network interface device.

CROSS REFERENCE TO RELATED APPLICATION

This application is a continuation-in-part of U.S. patent applicationSer. No. 08/685,805 filed Jul. 24, 1996 now U.S. Pat. No. 5,901,287,which is based on Provisional Application No. 60/015,384 entitledINFORMATION AGGREGATION AND SYNTHESIZATION SYSTEM, filed Apr. 1, 1996.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention is directed to an information aggregation andsynthesization system which connects with local and network accessibledatastores through an intermediary gateway system.

2. Prior Art

Widespread use of personal computers, modems (modulator/demodulatordevices that enable data to be transmitted) and data connections hasallowed the growth of computer networks. The Internet serves as anexample of a type of computer network, and indeed, is a large network ofnetworks, all inter-connected, wherein the processing activity takesplace in real time. The Internet offers mail, file transfer, remote login and other services. The World Wide Web (WWW) is the fastest growingpart of the Internet.

On the World Wide Web (WWW), a technology called hypertext allowsInternet addressable resources to be connected, or linked, to oneanother.

In the past, certain, limited aspects, of the present invention havebeen proposed, such as monitoring of computer usage.

Lockwood (U.S. Pat. No. 5,309,355) provides a computerized tool toaugment sales and marketing capabilities of travel agency personnel. Thesystem creates and displays customized sales presentations from (1)stored client profiles; (2) travel agent assessment of client profiles;and (3) computerized reservation system responses to client profiles.Selected factors are analyzed by the operating program based upon anorganization hierarchy of specifications.

Lockwood differs from the present invention in:

1) Data sources--Lockwood uses content from both a videodisk (static)and computerized reservation systems (dynamic). The present invention iscapable of deriving content from totally dynamic sources on the WorldWide Web (including Internet and local datastores or caches simulating aWWW component).

2) Client Profiles--Lockwood proposes that these be input by a TravelAgent. In the present invention, profiles are entered by the consumer(explicit) or collected through analysis of online session activity(implicit).

3) Data Organization--Lockwood uses preindexed videodisks. The presentinvention indexes prequalified WWW sites, updating these as they changeor as users expand their WWW searches.

4) Programation--Lockwood places the entire index of information in aPROM. This index is exercised by the sequencer which displays a salespresentation. The present invention stores indices in magnetic mediumbut retrieval and presentation of the indexed information is executeddynamically on premised upon user input.

Remillard (U.S. Pat. No. 5,404,393) discloses an electronic device andmethod for monitoring television activity and communicating themonitored activity to a facility and initiating appropriate actions. Acontroller initiates an automated configuration by acquiringconfiguration information. The controller monitors television channelselection information and assembles the monitored television informationinto a user profile. An option includes capturing images or text andforwarding to the user through a mail facility.

Remillard differs from the present invention in that it suggests adevice to access distant information through a television set. Thepresent invention utilizes network addressable information resource andhuman interface elements such as those used by the Internet, one ofwhich may in fact be attached to a TV. Remillard's invention (or that ofothers) may be used as a means to acquire WWW information but does notcontemplate the present invention.

Levinson (U.S. Pat. No. 5,404,505) provides information in a databasewhich is tagged with indices to form an hierarchical structure. Softwarehaving a set of subscriber requests handling routines interacts with adata filter subsystem. The data filter subsystem receives incoming datastream and selects those packets that meet certain selection criteria. Aspecial smart caching routing is provided for anticipating futurerequests by the user.

Levinson differs from the present invention:

1) Levinson proposes a satellite based information retrieval system.This is based on fixed data sources (Compuserve, Prodigy) being queriedby a user on a telephone line with the results being returned via atelevision connection. The present invention uses a similarinfrastructure to return requested information to the user but ourprocess for identifying content that is relevant is software agent basedand retrieval of dynamic content is from the WWW vs. fixed data sources.The present invention can use any means: for example, TV, Cable Modem,RF, ISDN, Modem, fixed line (T-2, T-3 etc.).

2) Levinson would establish user inputted profiles for "Automatic DataRetrieval". The present invention supplements user provided profileinformation by constructing implicit profile recognition patterns, basedupon historical search activity.

3) Levinson's invention does not specify any of the six componentsproposed in the present invention.

Griffin et al. (U.S. Pat. No. 5,422,809) provides an information storageand retrieval system for storing, referencing and retrieving varioustravel information from a database. A querying device queries the userfor input used to define the field for the travel destination desired.Statistical records are produced which provide relevant informationrelating to travel destinations using the system. Information is thusprovided which can be used to evaluate the popularity of particulardestinations.

Griffin et al. differs from the present invention in that it discloses akiosk system and the processes and subprocesses for self service travelplanning and reservations. While the present invention provides similarcapability using other means, the six features of the present inventionare not disclosed in this patent.

Senda (U.S. Pat. No. 5,459,859) discloses an information providingsystem using a communication network which stores attribute/scheduleinformation from each subscriber and uses that information to match withother subscribers.

Senda differs from the present invention in that it is a software basedsystem for meeting a system while traveling. It involves a best fitmatch between profiles. The present invention also provides a "best fit"but between software agents and data being viewed. Senda has bothformatted selection and source data inputted for a specific purpose (tomeet someone). The present invention uses software agents to formatselection data but the source data is unformatted from the WWW.

Belove et al. (U.S. Pat. No. 5,491,820) discloses a storage transmissionmechanism for retrievable items and may be used on the Internet. Thesystem may include a filter on each client or on the server between theuser and the Internet.

Belove et al. differs from the present invention in that it is a clientserver object caching system. Except for the pruning mechanism thatlimits the information cached at the client side, there is noresemblance to the present invention.

Accordingly, it is a principal object and purpose of the presentinvention to provide an information aggregation and synthesizationprocess and system connecting a network operable device and a pluralityof local or network accessible datastores wherein data passing therebetween is accessed, polled and retrieved through an intermediarygateway system.

SUMMARY OF THE INVENTION

The present invention includes at least six different aspects orfunctional components which are related, all involving use of a computeraccessible data network such as the Internet. While the individualaspects may be utilized together, they may also be used separately.

The user initiates access to the system through a network addressableinterface device (such as a personal computer, Internet Appliance, aninteractive television or even a personal digital assistant or smarttelephone). The user is then connected to the information aggregationand synthesization system via a network service provider (most likelythrough the Internet or some variation). The user logs on to the systemeither by name, address, or with some pseudonym (or some combination).This allows the user's activity to be tracked and establishes a log ofthe user's activity during the current online experience (session). Theuser is also asked for explicit profile information concerningpreferences. These preferences will be used to narrow the informationretrieval and may be collected when the user first logs in orincrementally as the user asks for specific information. This profileinformation will be kept and updated as the individual user'spreferences change.

Once the user is logged in, the information aggregation andsynthesization system will facilitate the user's access to localinformation or information distributed on a network (this network couldbe a local area network or a wide area network such as the Internet).All user access to information is through the system.

This information is topically oriented (Germany travel, the Olympics,Spring Break or even new cars), composed of files and file referencesusing the Hypertext Markup Language ("HTML") or similar tagged referenceformat that may be prescreened for relevance and appropriateness.Selected text can be "expanded" at any time to provide otherinformation. These words are, thus, linked to other documents. Thisinformation is indexed in this fashion in advance of the user's loggingin.

A gateway is provided into the WWW for shopping while retaining the userpassing through the information aggregation and synthesization system. Agateway is provided to poll, access and retrieve information fromvarious locations. A filtering process is provided and the resultinginformation is returned to the requested party.

The user is presented with a variety of search, display and outputoptions. The search options include: 1) Search using key words orcombinations; 2) Use of complex software text search agents that havebeen predefined by the information aggregation and synthesization systemsite operators. These agents take advantage of the expansive subjectmatter expertise in understanding which search parameters will bestserve the user's search needs; 3) Use of search patterns and agents fromthis user's previous sessions, perhaps expanded by available specialsand promotions; 4) Natural Language Query; and 5) Some combination of1), 2), 3) and 4).

The user selects information to be viewed from the results of thesearch. This information is retrieved from its source and presented tothe user in the manner and at the time requested. The available displayoptions include but are not limited to: display on the user's networkcapable device, personal TV channel, customized Internet page, customCD-ROM, electronic mail, mobile devices (Personal Digital Assistants,telephones and pagers) and facsimile. Information retrieval and displaycan be text, still pictures, videos, Interactive multimedia, audio andgeographic.

In certain situations, data from the datastores destined for the user isconverted prior to delivery to the user. The data stream returned to theuser may be modified to fit the bandwidth, character set and displaylimitations of the network and may be modified to meet the limitationsof the user interface device.

Along with displays, including those for data entry, searches, searchresults, information retrieval, the user will be presented withadvertisements and/or coupons based on criteria entered by advertisers.This criteria may take the form of simple logic, linking an ad/couponwith a display or be derived from complex software text search agentsthat analyze one or more of the following: The user's looking pattern,the user's psychographic profile, the user's personal profile, theavailability of the advertiser's/couponer's goods or services at theinstant in time that the criteria is being exercised. The placement ofthe ad/coupon will be logged along with user profile information andprovided to the advertiser/couponer in some form of report.

During a user session or when a user completes a session, the user'slooking activity is analyzed for patterns, preferences and trends andthe profile annotated or updated so that when they next use theinformation aggregation and synthesization system, the nominatedsearches will be customized to their individual desires.

The six aspects of the information aggregation and synthesization systemare:

I. URL Munging

The World Wide Web ("WWW") is characterized by computer (user)connection through an Internet Service Provider to any WWW address orsite. Hence, use of the WWW is like placing individual telephone callsto many merchants, trying to compare products and services. URL Mungingis the process that allows the goods and services of many merchants tobe displayed through a single virtual shopping center.

This involves encapsulating and indexing the content of variousmerchants as well as modifying parts of the internal structure,repurposing and redirecting it to be integrated into the informationaggregation and synthesization process. This allows content from andaccess to multiple merchants to be aggregated, synthesized and accessedat a single WWW site.

II. WWW CD-Rom

World Wide Web ("WWW") access from homes is often constrained by thelack of sufficient data communications bandwidth within a typicalresidential infrastructure (WWW information may be accessed through theInternet WWW, a local Internet WWW, or a local datastore or cachesimulating a WWW component).

The Internet user will select World Wide Web (WWW) content for retrievalusing a search engine to return selected WWW references. The user willthen select certain of these references to be included in a custom CDwhich will be burned or recorded onto a CD and then sent by expressdelivery to the user.

III. Software Agent Advertising Insertion

Currently, advertisements in WWW pages are tightly tied to each page,are inserted based on keywords or on a psychographic profile of theuser.

Certain criteria will be entered which delineates a pattern that isrequested to be monitored. When this pattern is seen (or is in closematch) in the user's WWW activity, the insertion mechanism is activated.If a certain web page is requested, the present invention will display aparticular advertisement. The ad will be inserted based on the contentof the existing web page being read. An analysis of the text stream ofthe user's interactive session will be performed on-line. For instance,if the user accesses web pages for Holiday Inns on the West Coast, theinsertion mechanism could be established to automatically insert ads forHilton Inns on the West Coast.

IV. Automated Profile Generation

Presently, user's profiles are collected based on explicit entry by theuser, and extraction from demographic data collected from a variety ofsources.

In the present invention, the searching patterns of the user on theInternet are monitored. A set of software text agent profiles isdeveloped and may be integrated with explicitly collected profileinformation. The automated profile generation will have both explicitprofile information gathering and implicit profile information gatheringcapabilities.

As the user uses the information aggregation and synthesization system,the pattern of information being viewed is analyzed. During a user'ssession, advanced text analysis tools are used in real-time tounderstand the interests of the user by synthesis of the text stream ofpages looked at. This synthesis is used as input to a statisticalcorrelation with similar interests of a larger population. The resultsof this correlation are used to predict the extended interests of theuser. These are matched using intelligent software text agents and avariety of reasoning techniques. The user is presented with search ideasas well as promotions and specials from suppliers based on thesesearching patterns.

V. Automated Lead Generation Currently, leads are generated by recordinguser's WWW site selection. (For Example, user's visiting a "Chicago"information site would be "Chicago" leads.)

In the present invention, the user WWW viewing patterns are recorded.These and optionally the user's profile are matched against softwaretext agents entered by a supplier. When these agents match apattern/profile, the supplier is notified. When this profile isapproximately matched, the supplier is notified.

VI. Software Agent Unmet Needs Generation.

Currently, there is no on-line immediately accessible system to analyzeunmet needs of Internet users.

In the present invention, records will be maintained from user usage ofthe Internet on what consumer queries are unmet by the WWW contentretrieved. The invention will intuitively construct a profile from userinputted data. This will be done by recognizing unanswered queriesand/or user initiated requests. From this, a profile will be developedto identify new markets. As an example, if one hundred people inquireabout snorkeling off the coast of Texas, this information could be soldto a tour provider who could not only prepare a travel package but sellthe leads to a company. Thus, the system will be able to gather"negative" leads.

In the course of a session, the user may desire information not yetavailable. This information could be in the form of a product, a serviceor an event. The user then can establish a persistent (stays aroundafter the user's session is over) complex software text search agent tomonitor future information additions to the System and alert the userthrough a variety of means (facsimile, electronic mail, text page,voice, pager) that the information that was requested is available or insome instances, provide the information directly. The set of persistentagents will also be analyzed by the information aggregation andsynthesization system operators and provided to potential suppliers whowould in turn develop new product offerings which would be added to theinformation aggregation and synthesization system sources.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are incorporated in and constitute apart of the specification, illustrate presently preferred embodiments ofthe invention and, together with the preceding general description andthe following detailed description, explain the principles of theinvention. In the drawings:

FIG. 1 illustrates an interface of the present system with a user accesssystem and various data sources;

FIG. 1A illustrates a modified arrangement of the interface of presentsystem with alternate user access systems and alternate networkinterface devices;

FIG. 1B illustrates a limited bandwidth limited character set subsystemconsistent with the present invention.

FIG. 2 illustrates several datastore categories and an I/O systemconsistent with the present invention;

FIG. 3 illustrates dialog management and agent datastore categoriesconsistent with the present invention;

FIG. 4 illustrates operation systems categories consistent with thepresent invention;

FIG. 5 illustrates a flow diagram for a WWW CD ROM consistent with thepresent invention;

FIG. 6 illustrates a flow diagram for a software agent advertisinginsertion consistent with the present invention;

FIG. 7 illustrates a flow diagram for automated profile generationconsistent with the present invention;

FIG. 8 illustrates a flow diagram of lead generation consistent with thepresent invention; and

FIG. 9 illustrates a flow diagram for an unmet need agent consistentwith the present invention.

DETAILED DESCRIPTION

In the embodiments described herein and accompanying figures, a travelinformation scenario is depicted. It will be understood that the presentinvention is capable of performing similarly for other venues, such asmortgages, automobile sales and any other interactive exchange ofinformation sought by information content seekers and potentiallysatisfied by information content providers.

Initial Setup For User

Referring to the drawings in detail, FIG. 1 illustrates a diagramshowing the interface of the present system 200 with a user on a useraccess system 100 and various data sources. FIG. 2 illustrates severalof the datastore categories. The use of the present invention has atleast five phases:

Initial Setup For User

Initial Setup For Advertisers and Lead Generation

Ongoing Maintenance

User Session

Post Session Activity

A theme or definition of a class of information (e.g., centralCalifornia travel and tourism or new automobiles) is identified. Datasources (Local DataStores (500 . . . N) and Network AccessibleDataStores (300 . . . N)) are screened for relevance, quality ofinformation and appropriateness (or may be included de facto based ontheir title or description). These are indexed using a text indexingsoftware tool 2981 and the indices stored on the system index DataStore220. An initial set of Preestablished Software Text Agents are defined.These agents are words or combinations of words that form a word basedsearch pattern. This initial set of agents is relevant to the searchesthat might be performed against the class of information that wasindexed. (i.e., Agents about automobiles would be developed to search aclass of indexed information about new cars). These are stored in thePreestablished Software Text Agent DataStore 231. The System 200 usesany multipurpose computer central processing units with the ability tohandle multiple inputs and outputs with the necessary hard disk storageand to run World Wide Web (WWW) or other network server software.

FIG. 1A illustrates a modified arrangement of the interface of thepresent system 200 with alternate user access systems and alternatenetwork interface devices.

The present system 200 is in communication with a limited band widthlimited character set system (LBLCS) 289 which is a subsystem ofinput/output system 280.

Although today's WWW access is normally with broad band, high speednetworks, many corporate intranets operate on limited capability, slowspeed networks. The LBLCS system 289 allows conversion of the rich mediaused on today's WWW into text-only media with multi-media references asanchors that preserve the essential information to be passed in HTML orother tagged reference format to the user. For users with limited bandwidth limited character set networks, the WWW datastore informationwhich is returned to the user is altered. Any graphics files areidentified, eliminated and replaced with a text anchor. For example,certain networks or user access systems can not handle graphics files. Atext page which is returned to the user 110 or 120 which containsgraphic files will be identified. The graphic file itself will beeliminated and in its place a text reference, such as "(picture)", isinserted.

User access system 110 is connected through a limited private network tothe LBLCS 289 subsystem. FIG. 1b illustrates a block diagram of theLBLCS subsystem.

User interface system 120 illustrates a connection through a limiteddial network into the LBLCS subsystem 289.

The return datastream from the datastores to the user is modified to fitthe bandwidth, character set and display limitations of the network andof the user access device.

In one implementation of the present system, terminals for travel agentsmay be provided with access to the system 20. In certain cases, travelagent terminals are much more limited than ordinary personal computerCPU's. Through the usage of present invention, agents will be providedaccess to the information aggregation and synthesization system 200.

Initial Setup For Advertisers and Lead Generation

Advertisers

Advertisers, using a user access system 100 enter criteria that shouldbe met for an advertisement/coupon placement. These criteria are in theform of the complex software text search agents described above. Thisincludes a match "threshold". When this threshold is met or exceeded, anad/coupon will be appended to a system session. Statistical analysisknown as clustering is used to evaluate the data.

The ad/coupon may be resident on the user access system 100, anadvertiser's computer system (400 . . . N) or stored in the AdvertisingDataStore 250. Additionally, the Advertiser may include conditionalcriteria for ad/coupon placement (available inventory, in stock levels,excess capacity, etc.). This criteria is referenced when the "threshold"is met and if satisfactory, the ad/coupon is appended. This criteria maybe tested against data input through the user access system 100, data onthe advertising DataStore 250 or data on the advertiser's computersystem (400 . . . N). Additionally, advertisers can input World Wide Web(WWW) referential information (hot links) to be displayed withads/coupons or on geographic map displays. These are stored on theadvertising DataStore 250.

Lead Generation

Lead Purchasers, using a user access system 100 enter criteria thatshould be met for the generation of a lead. These criteria are in theform of the complex software text search agents described above. Thisincludes a match "threshold". When this threshold is met or exceeded,information about the current user and the information being viewed isstored in the lead DataStore 270 for variable output transmission to thelead purchaser.

Ongoing Maintenance

Index Updating

Local DataStores (500 . . . N) and network accessible DataStores (300 .. . N) will change randomly and will become out of synchronization withthe system index DataStore 220. The data monitoring system 2982 willperiodically monitor local DataStores (500 . . . N) and networkaccessible DataStores (300 . . . N) and when there is a change, updatethe index DataStore 220.

Data Addition

Operators will add data to the local DataStores (500 . . . N) and usersusing a user access system 100 will nominate data from the networkaccessible DataStores (300 . . . N) to be added to the index DataStore220. Operators will update the indices using the data indexing service2981 if the data passes the screening outlined in the initial setup forusers above.

User Session

Login and Profiles

Browsing

Data Retrieval

User Interrupt

Ad/Coupon Insertion

Persistent Agents

Login and Profiles

Users using a user access system 100 access the information aggregationand synthesization system 200 through the Internet or other public orprivate network. The user either logs in by name or by pseudonym or fromdata previously stored in the user access system 100. New users createan account on the user profile DataStore 210. Previous users areidentified to an existing account. The user is presented with a varietyof options to create or update profile information in the user profileDataStore 210. This involves a single data entry option or manymini-options based on the browsing activity.

Browsing

The user is also presented with browsing options based on: activity froma previous session in the browsing activity DataStore 240; predevelopedsoftware text agents and personalized software text agents (developed inthe Post Session Activity) stored in the Personal Search Text AgentDataStore 232; or combinations of all as well as situationalopportunities developed by the user greeting subsystem 291. The userselects the search options to be used (or simply enters search criteriadirectly). This search criteria is used to search the index DataStore220 and a list of data sources is presented to the user for selection.The user indicates the information to be viewed. The user will also bepresented with options to refine his search through the altering ofsearch agent criteria (Search Reduction System 293).

Data Retrieval

The requested data is retrieved either from local DataStores (500 . . .N) or network accessible DataStore (300 . . . N) and presented to theuser via the session management system 292. The user may jump to datareferenced in the presented data. Subject to the appropriate policies ofthe site operation, the session management system 292 will furtherretrieve and present this data to the user. The user may request thatdata be overlaid on a geographic display using the Geographic DisplayI/O System 287 so that referenced information may have geographicrelevance.

User Interrupt

The user interrupt system 294 will periodically notify the user ofspecialized software text agents that they may want to pursue. TheseAgents are stored in the agent DataStore 230 and are derived by the realtime session analysis system 295 which monitors the browsing activityDataStore 240 during the user's session.

Ad/Coupon Insertion

During the session, ads/coupons are inserted alongside displayed data(text, picture or index displays) from the ad DataStore 250, based onad/coupon insertion agents 233 and inserted by the session managementsystem 292. A Record of Insertion along with appropriate userinformation (may be general or precise to the name of the user) isstored in the advertising activity DataStore 260.

Persistent Agents

At any time, the user may establish a persistent software Text Agent(using the persistent agent entry system 297, stored in the unmet needsagent DataStore 234) with criteria, if met sometime in the future, willcause the user to be notified through the I/O System 280. These can beexplicit or implicit query parameters.

Post Session Activity

Periodically, either due to a preset time interrupt, or user oradvertiser event driven activity, the following can occur:

Unmet Needs Analysis

Advertising Report

Profile Updating

Lead Report

Targeted Output

Output Activity

Unmet Needs Analysis

Users using the user access system 100 will be able to establishpersistent (stays in the system after the user quits using the system)software text agents which describe some criteria, which, if met, willcause them to be notified. These are stored in the unmet needs agentDataStore 234. These unmet needs agents 234 are analyzed using the unmetneeds analysis system 299 and reports are created through the I/O System280 for suppliers who could potentially meet those needs.

Advertising Report

Information about each Ad/Coupon appended to an information aggregationand synthesization system along with known information about the user isstored in the advertising activity DataStore 260. This is reported outperiodically to the advertisers/couponers using the I/O System 280.

Profile Updating

During a session or after a user discontinues use, the data viewed(recorded in the browsing activity DataStore 240) is analyzed by thesession profile update 2921 and the user profile DataStore 210 isupdated with keywords or personal search text agent DataStore 232.

Lead Report

Periodically, the Software Text Lead Agents stored in the leadgeneration agent DataStore 235 are used to analyze the data viewed(recorded in the browsing activity DataStore 240) and reports preparedfor lead purchasers using the I/O System 280.

Targeted Output

Users through the user input system 100 will be able to designateinformation to be output and the format that the I/O System 280 willuse.

Output Activity (Using the I/O System 280)

All output systems will provide for the addition of specials, ads and/orcoupons.

Options are

Personalized Page 281--This will create a page accessible through theWWW where the user can access requested information.

SMTP Electronic Mail 282--This allows the delivery of user requestedinformation using the SMTP capability of the Internet and other popularelectronic mail systems.

CCITT Class 3 or Class 4 Facsimile 283--This allows user requested datato be formed as a printed page and sent via Fax to a Fax receiver of theuser's choice.

Voice output direct or to a Voice Mail Box 284--This translates the userrequested data to audio, connects to the user or their voice mail systemand transmits the audio.

Personal TV or video feed 285--This formats the data in a formcompatible with transmitted video and allows viewing on demand.

Custom CD-ROM 286--This places the requested data, indices, viewers andall necessary software on a user Unique CD-ROM for physical delivery.

Geographic Display I/O System 287--This allows the user to view contentgeographically, to look at the geographic proximity of merchants andservices and provides a vehicle for ads and hot links.

Mobile/Portable System 288--This allows Specially formatted GenieInformation to be displayed or translated for a wide variety of mobileand portable devices.

Identification of Key System Components by Reference Numerals

100 User Access System

110 Limited private network user access system

120 Limited dial network user access system

200 System comprised of

210 User Profile DataStore

220 Travel Genie Index DataStore

230 Agent DataStore

231 Preestablished Software Text Agents

232 Personal Search Text Agents

233 Ad/Coupon Insertion Agents

234 Unmet Need Agents

235 Lead Generation Agents

240 Browsing Activity DataStore

250 Advertising DataStore

260 Advertising Activity DataStore

270 Lead DataStore

280 I/O System

281 Personalized Page Output System

282 SMTP Electronic Mail System

283 CCITT Class 3 or Class 4 Facsimile

284 Voice Output

285 Personal TV or Video Feed

286 Custom CD-ROM

287 Geographic Display I/O System

288 Mobile/Portable Device System

289 Limited Bandwidth Limited Character Set System

290 Operations System

291 User Greeting System

292 Travel Genie Session Management System

2921 Session Profile Update

293 Search Reduction System

294 User Interrupt System

295 Real Time Session Analysis System

296 Ad/Coupon Insertion System

297 Persistent Agent Entry System

298 Data Support Systems

2981 Data Indexing Service

2982 Data Monitoring System

299 Unmet Needs Analysis System

300 Network Accessible DataStores

301 . . . N

400 Advertiser's Computer Systems

401 . . . N

500 Local DataStores

501 . . . N

100 User Access System

This is a network addressable interface device, such as a conventionalpersonal computer capable of initiating and maintaining a networkconnection and sending, receiving and displaying data including adigitized data visual representation device such as a monitor andauxiliary storage, such as a floppy disk drive. It may also be a TV set,smart telephone or network appliance with similar capabilities. It willmaintain a connection through a modem (a modulator/demodulator device)that enables data to be transmitted and received.

200 DataStores

FIG. 2 illustrates DataStores utilized as a part of the invention. Theinformation aggregation and synthesization system includes:

210 User Profile DataStore

This contains data about the user, preferences, situational preferences,accounting information, psychographic profile, personal profile andother relevant information related to the user by individual identifier.

220 System Index DataStore

This is the index of data accessible by the system.

It is established initially and updated as data changes or new datasources are added. It is queried by Agents from the Agent DataStore 230or by key words.

230 Agent DataStore

231 Preestablished Software Text Agents

These are complex software text search patterns predefined by the sitesubject matter experts using their extensive knowledge of informationcontained within the site's indices.

232 Personal Search Text Agents

These are complex software text search patterns that may be individualwords or word sets and/or combinations of words and PreestablishedSoftware Text Agents 231 including the results of the post sessionanalysis 2921 that provide individually customized searching of theIndex DataStore 220.

233 Ad/Coupon Insertion Agents

These are complex software text search patterns that when matched withinthe text being reviewed within a given session, cause anadvertisement/coupon to be added into the display. These can be directinsertion or conditioned from criteria on the Advertiser's ComputerSystems (400 . . . N) and/or the user's profile from the user profileDataStore 210

234 Unmet Need Agents

These are complex software text search patterns created by the user topersist after the end of the user session looking for patterns and/orspecific events or data that are observed within the System 200 at somefuture time.

235 Lead Generation Agents

These are complex software text search patterns that when matched withinthe text being reviewed within a given session, causes an addition tothe Lead DataStore 270 for output to the lead purchaser using the I/OSystem 280.

240 Browsing Activity DataStore

This is the record of the "looking" activity of each user in eachsession.

250 Advertising DataStore

This is the storehouse of ads to be presented when a match is made bythe Ad/Coupon Insertion Agent 233

260 Advertising Activity DataStore

This is the record or ads presented by the Ad/Coupon Insertion System296 and information about the user seeing the ads from the BrowsingActivity DataStore 240 and the user profile DataStore 210

270 Lead DataStore

When a Lead Generation Agent 235 makes a match, Data about the user fromthe user profile DataStore 210 and the Browsing Activity DataStore 240is stored here.

280 I/O System

These are the various ways that output can be channeled, for the user,the advertiser or the lead purchaser.

281 Personalized Page Output System

This allows output text and associated objects to be formatted forgeneral or selective viewing through any system using Hypertext MarkupLanguage (HTML), VRML (Virtual Reality Modeling Language) or othernetwork compatible display based language either locally or over anetwork.

282 SMTP Electronic Mail System

This allows output text for whatever purpose to be formatted in a formatcompatible with the SMTP (Simple Mail Transport Protocol) andtransmitted to a designated addressee.

283 CCITT Class 3 or Class 4 Facsimile

This allows output text and associated objects for whatever purpose tobe formatted to be compatible with the CCITT Class 3 or Class 4 Faxstandard and transmitted to a designated fax receiver.

284 Voice Output

This allows output text for whatever purpose to be formatted into voicefor transmission to a human receiver or a voice mail box.

285 Personal TV or Video Feed

This allows output text and associated objects for whatever purpose tobe formatted as a TV signal (any international standard) to be accessedand replayed using local or network capability at the request of anindividual user (or a class of users).

286 Custom CD-ROM

This allows the user to designate certain data to be placed onto aCD-ROM along with all necessary search and viewing software as well asnon user requested ads and promotions.

287 Geographic Display I/O System

This allows data requested by the user to be overlaid on a geographicreference system (a map).

288 Mobile Device System

This allows output to be formatted for a variety of devices includingbut not limited to: pagers, personal digital assistants, mobilecomputing devices and other wireless devices.

289 Limited Bandwidth, Limited Character Set (LBLCS) Data Network

The software module input/output system identifies graphic files,removes them and replaces them with text anchors. The LBLCS module maybe resident on the I/O system 280 or be established on separatehardware.

290 Operations System

291 User Greeting System

This is the subsystem that identifies users, customizes search screens,incrementally collects explicit profile information and formulatessearch agent screens and search specials which may be situational orseasonal or both.

292 Session Management System

This tracks and records a user's browsing activity, sets ID tokens,establishes accounts, translates anonymous users to named users andmanages the user's implicit profile information.

2921 Session Profile Update

Uses the Browsing Activity DataStore 240 records, to analyze and updatethe user's profile in the user profile DataStore 210

293 Search Reduction System

This aids the search by suggesting changes to the complex software textsearch agents to refine the user's search.

294 User Interrupt System

Based on the Real Time Session Analysis 295 of the users lookingactivity (stored in 240), determines associated references, agents orother information to be offered to the user and interrupts the user'ssession with an interactive data screen.

295 Time Session Analysis System

This monitors the user's browsing activity and analyzes the apparentinterests to trigger the user interrupt system 294.

296 Ad/Coupon Insertion System

This looks at the current display requested by the user with a Ad/CouponInsertion Agent 233, determines which ads should be placed (or rotated)and makes the placement (or establishes the rotation).

297 Persistent Agent Entry System

This is the mechanism whereby the user enters the Unmet Need Agent 234.This agent monitors text and data changes and if the requesteddata/pattern occurs, the user is notified via the I/O System 280.

298 Data Support Systems

2981 Data Indexing Service

This is the facility that indexes designated DataStores (either NetworkAccessible DataStore (300 . . . N) or Local DataStores (500 . . . N)upon operator input or periodically and stores these indices in theIndex DataStore 220.

2982 Data Monitoring System

This facility, periodically or on demand, checks indices stored in theIndex DataStore 220 against actual data (either Network AccessibleDataStore (300 . . . N) or Local DataStores (500 . . . N)) and if it haschanged, queues for operator review or updates indices.

299 Unmet Needs Analysis System

This analyzes the persistent agents for common patterns or specificrequests that can be custom tailored. The results are outputted throughthe I/O System 280.

300 Network Accessible DataStores

301 . . . N

These are an infinite number of network data sources that are includedin the scope of the information aggregation and synthesization. Theseare represented by

(300 . . . N)

400 Advertiser's Computer Systems

401 . . . N

These are DataStores established by advertisers to store ads/coupons tobe presented or to set additional conditions for display.

500 Local DataStores

501 . . . N

These are similar to the 300 series but locally vs. wide area networkaccessible.

Each of the six aspects of the present invention will be discussed indetail.

I. URL Munging

The present invention becomes a gateway to network data content providedby others. The present invention directs access which is controlledthrough an intermediary gateway system.

The user, through a network addressable interface device such as theuser access system 100, will connect with a local or network accessibleDataStore. The user will select a page (designated by a Uniform ResourceLocator or URL) to be used. The URL will be modified or "munged" so thatretrieval must go through the present invention when the user executes aretrieval request. This then permits return of requested data to theuser from the DataStore, at all times passing through the presentinvention 200.

The URLs embedded in each page that pass through are indexed by thepresent invention or "munged" so that any hyper linking to another WWWsite always goes through the present invention. As an example,"WWW.anywhere.com" is converted to "WWW.travelgenie.com?WWW.anywhere.com", even though the user will see a direct path to thedistant site.

Accordingly, when the user clicks on a URL (or types it in a browser'ssearch request), the user will connect to the requested site through thesystem 200.

The present invention may be utilized with a wide variety of networkaddressable interface devices. When the invention is utilized on alimited bandwidth, limited character set data network, the datastreamreturned to the user will pass through the LBLCS network 289. Thedatastream is modified to fit the bandwidth, character set and displaylimitations of the network and the limitations of the user accessdevice.

II. WWW--CD ROMS

The user of a network addressable interface device will select WorldWide Web (WWW) data content for retrieval using a search engine toreturn selected WWW references. The user will then select and designatecertain of these references to be included in a custom CD-ROM which willbe burned or recorded onto a compact disc and then sent by expressdelivery to the user.

The user will designate pages and other WWW data content including butnot limited to HTML files, audio files, still images and other graphicfiles from the WWW. Through the session management system 292, selectedmaterial will be designated and retrieved. The retrieved data will beincluded in a custom CD-ROM produced by a service bureau and then sentby a delivery service to the user. FIG. 5 shows a process flow diagram.

Optionally, the designated data may be communicated to the user viaautomated telephone means, may be communicated to a user via electronicreplication, or may be copied on to auxiliary computer storage such asthrough a floppy disk drive.

III. Software Agent Advertising Information

Advertising is provided which benefits the user while optimizing theadvertiser's expenditure by only presenting ads or coupons (or ads andcoupons in a rotation if multiple ads/coupons qualify) that arepertinent to that particular user.

Certain criteria will be entered which delineates a pattern that isrequested to be monitored. When this pattern is seen (or is in closematch) in the user's WWW activity, the insertion mechanism is activated.If a certain web page is requested, the present invention will display aparticular advertisement. The ad will be inserted based on the contentof the existing web page being read. An analysis of the text stream ofthe user's interactive session will be performed online. When certaintext patterns are observed (or close matches are observed), anadvertisement is inserted into the display.

The advertising may be static or connected to the advertiser's computerDataStore which designates specific ads or coupons based on the patternmatch and other conditions which may be required.

FIG. 6 illustrates a flow diagram for the software agent advertisinginsertion.

The software agent criteria is entered by the merchant in the agent datastore 230 which delineates a pattern that needs to be monitored.

As an example, if the user accesses web pages for "Holiday Inns on theWest Coast", the insertion mechanism would be established toautomatically insert ads for "Hilton Inns on the West Coast".

IV. Automated Profile Generation

Browsing patterns of the user are analyzed and these patterns updateprofiles automatically.

FIG. 7 illustrates a flow diagram for the Automated Profile Generation.

The looking patterns of the user are monitored to develop a set ofsoftware text agent profiles that are integrated with explicitlycollected profile information to assist the user in narrowing downinformation for future sessions as well as suggesting references,merchandise or services during the current session. This is accomplishedby statistical analysis of the text stream.

The searching patterns of the user on the Internet are monitored bymonitoring the text stream. A set of software text agent profiles isdeveloped and may be integrated with explicitly collected profileinformation. The explicit information is gathered by queries to theuser. The explicit and implicit data are merged to develop software textagents that support the user's future shopping sessions.

During a user's session, advanced text analysis tools are used in realtime to understand the interests of the user by synthesis of the textpages looked at. This synthesis is used as input for statisticalcorrelation with similar interests of a larger population. The resultsof this correlation are used to predict extended interests of the user.These are matched using intelligent software text agents and a varietyof reasoning techniques including case based reasoning and fuzzy logicto establish a recommended list of search ideas, promotions andspecials. The use of collaborative filtering may also be employed. As anexample, if the text analysis indicates that the user has looked atdownhill and cross-country skiing, past usages from a larger populationmay indicate that the user will also be interested in ice skating.

As seen in FIG. 7, real time analysis of data is illustrated at box 295.The real time session analysis is in communication with the userinterrupt system 294 so that the session may be interrupted at anappropriate point. At the same time, a post session profile update 2921will update profiles based on browsing activity from a past session andthereafter stored in user profile DataStore 210.

V. Automated Lead Generation

It is known that suppliers will pay for information gathered aboutuser's specific interests. When tied to a specific user, these become"leads" that a supplier can use for off-line follow up. The automatedlead generation aspect will analyze a user's profile and session lookingactivity against a profile established by a supplier. When this profileis approximately matched, the supplier is notified so it can contact theuser to offer goods or services. Statistical analysis using complexsoftware text agents is used to determine the match.

FIG. 8 illustrates a flow diagram of the lead generation.

In the present invention, the user's WWW viewing patterns are monitored.These and optionally the user's profile 210 are matched against softwaretext agents entered by a supplier in an agent DataStore 230. When theseagents match a pattern or profile, the supplier is notified.Additionally, when this profile is approximately matched, the supplieris notified. Lead purchasers, using a user access system 100, will entercriteria that should be met for the generation of a lead. These criteriaare in the form of complex software text search agents. When thisthreshold is met or exceeded, information is stored in the leadDataStore 270 for variable output transmission to a lead purchaser.

VI. Software Agent Unmet Needs Generation

In the present invention, records will be maintained from user usage ofthe Internet and other networks on what consumer queries are unmet bythe WWW content retrieved.

FIG. 9 illustrates a flow diagram.

If the user does not find what they are looking for, a "watcher" agentmay be set up to advise them if the object of their search occurs atsome future time. An example would be a tour, a price or some otherinformation. Through the session management system 292 a threshold willbe established on the user need.

The invention will intuitively construct a profile from user inputteddata. This will be done by recognizing unmet or unanswered queriesand/or user initiated requests. From this, a profile will be developedto identify new markets. The system will thus be able to gather"negative" leads. This information may be extracted and sold tosuppliers who will build new products and services and then use thesystem as a mechanism to notify the potential customer.

Whereas, the present invention has been described in relation to thedrawings attached hereto, it should be understood that other and furthermodifications, apart from those shown or suggested herein, may be madewithin the spirit and scope of this invention.

What is claimed:
 1. An information aggregation and synthesizationprocess to retrieve information in a network in which a user operates anetwork addressable device, which process comprises:communicatingbetween said network addressable device and at least one networkaccessible datastore through network addressing means; analyzing of areturned text stream from said network datastore; and retrieval from anadvertising datastore and insertion of advertising or product discountinformation in the text stream based upon a threshold matching of apredetermined criteria and said text stream analysis.
 2. An informationaggregation and synthesization process as set forth in claim 1 whereinsaid network specific addressing means includes Uniform ResourceLocators (URLs).
 3. An information aggregation and synthesizationprocess as set forth in claim 1 wherein said analyzing is performedthrough an intermediary gateway system.
 4. An information andaggregation and synthesization process as set forth in claim 1 includingthe additional step of identifying graphic material in data returnedfrom said datastores and replacing said graphic material with a textanchor.
 5. An information aggregation and synthesization process toretrieve information in a network in which a user operates a networkaddressable interface device, which process comprises:communicatingbetween said network addressable interface device and at least onenetwork datastore through network addressing means; accessing a textstream passing between said network addressable interface device andsaid network datastore by an intermediary gateway system; retrieval ofadvertising or product discount information from an advertisingdatastore based upon a threshold matching of predetermined criteria andthe text stream analysis and insertion of the advertising or productdiscount information in said text stream; gathering of explicitinformation from said user and gathering of implicit information todevelop a user profile; providing information about said user to a leadpurchaser; and providing information to a third party to meet needsidentified.
 6. An information aggregation and synthesization process asset forth in claim 5 wherein said network specific addressing meansincludes Uniform Resource Locators (URLs).
 7. An information aggregationand synthesization process as set forth in claim 5 wherein saidanalyzing is performed through an intermediary gateway system.